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(57) Abstract 

A coding unit (420) 
contains a syntax-based 
adaptive arithmetic coder 
layer (422) which select 
arithmetic coding models 
and arithmetic coding 
layer (424) which peiforms 
arithmetic coding using the 
coding models. The coding 
unit implements variable 
length coding which encodes 
a string of symbol values 
using arithmetic coding 
models selected according to 
Che syntax of the string. The 
arithmetic coding models are 
optimized for each separate 
symbol in the string to 
provide efficient coding that 
provides a shorter average 
code length than is provided 
with arithmetic coding . 
using a single mode). In 
an embodiment for moving 
image coding, two sets of 
arithmetic coding models, 
one for intra frames and 
one for inter frames, are 
used for a string of symbols 
repitscmlng DCT blocks. 

Thr model used for a symbol depends of the symbols order in the string. A decoding unit (450) contains an arithmetic decoding layer 
(454) which identifies symbol values from a coded bitstream and a syntax-based adaptive arithmetic coder layer (452) which selects an 
arithmetic coding model for the decoding layer based on the syntax of the string being decoded. 
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SYNTAX-BASED ARITHMETIC CODING FOR 
LOW BIT RATE VIDEOPHONE 

BACKGROUND OP THE INVENTION 

5 Field of the Inventiion 

This invention relates to low-bit rate digital 
transmission of moving images such as in videophone 
communications over conventional phone lines. 

10 Descript ion of Related Art 

PSTN Phone lines are commonly used for digital 
communications between computers or other devices using 
modems and standard communications protocols. Such 
communication protocols have bit rates limited by the 

15 quality of transmission over the phone lines. For 

example, tlie V.PAST standard has a bit rate of between 
16-8 and 28.8 kbit/s depending on the quality of the 
connection. These bit rates are low when compared to 
the requirements for transmitting high quality digital 

20 moving images. 

Conventional digital moving images are a 
succession of frames (or still images) which are 
commonly represented by two-dimensional arrays of 
pixels values. The pixel values indicate colors or 

25 intensities of pixels in the frames. Transmission of 
pixel values by videophones is impractical because of 
the large amount of data required to transmit every 
pixel value in every frame of a moving picture. 
Accordingly, typical videophone systems contain an 

30 encoding circuit which converts a succession of frames 
(two-dimensional arrays) into a compressed 
representation of the moving image. 

Fig. 1 shows a video encoding portion of a 
videophone which generates a compressed representation 
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of a moving image. An input video signal VIDEO_IN from 
a video camera or other video source indicates pixel 
values in frames of the moving image. An information 
extracting circuit 110 converts the representation of 
5 the moving image to a format where redundant 

information is more easily removed. For example, 
difference- frame coding subtracts pixel values from a 
current frame from pixel values in a preceding frame 
and extracts non-zero values which indicate changes 

10 between the frames. Redundant data, that are repeated 
in successive frames, appear as zero values in the 
difference, and the large number of zero values can be 
effectively compressed or removed. 

Motion estimation reduces the number of non-zero 

15 values in the difference between frames by subtracting 
a block having a position indicated by a motion vector 
in a preceding frame from a block in a current frame. 
The motion vector is selected to reduce the difference. 
Additionally, blocks may be transformed using 

20 transformations such as a discrete cosine 

transformation (DCT) or a Walsh transformation. Such 
transformations provide a block of coefficients having 
few non-zero values and long runs of zero values. 
Typically, information extracting circuit 110 quantizes 

25 the transformed blocks and converts the transformed 

block into a series of symbols. A symbol may indicate, 
for example, a non-zero value in the block and a number 
of consecutive zero values in a run preceding the non- 
zero value. 

30 A coding block 120 codes symbols for transmission. 

One well known variable length coding technique, 
commonly referred to as Huffman coding, matches each 
symbol to a Huffman code in a table. The table is 
constructed so that the most common symbols have the 

35 Huffman codes with the fewest bits so that on average 
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coding block 120 reduces the number of bits required to 
express a string of symbols. 

A problem in prior art Huffman coding is that the 
bitstream representing a moving image commonly contains 
5 symbols indicating motion vectors, symbols indicating 
image parameters, and symbols indicating the start of a 
frame or block. Coding block 120 typically does not 
efficiently code these symbols because the values of 
the symbols have statistical distributions that differ 

10 from the statistical distributions of symbols 

representing difference blocks- Further, although 
Huffman coding reduces the number of bits in a 
description of a moving image, more efficient symbol 
coding would allow further improvements image quality 

15 at low bit rates. 

Arithmetic coding is known in fields other than 
moving image coding. Arithmetic coding, instead of 
encoding individual symbols as does Huffman coding, 
encodes strings of symbols. Fig, 2 illustrates an 

20 example of arithmetic coding. In the example, the set 
of possible values for each symbol is {a, e, i, o, u, 
!}, and the string encoded is "eaii!". Symbol " ! " 
indicates the end of a string. An arithmetic coding 
model for the symbol divides an interval into segments 

25 which have lengths proportional to the probability of 
the symbol occurring in a string. In Fig. 2, the 
probabilities for symbols a, e, i, o, u, and ! are 20%, 
30%, 10%, 20%, 10%, and 10%; and the model divides the 
interval [0,1) into segments [0,0.2), [0.2,0.5), 

30 [0.5,0.6), [0.6,0.8), [0.8,0.9), and [0.9,1.0) which 
correspond to symbols a, e, i, o, u, and ! 
respectively. 

To determine a code value within interval [0,1), 
the interval [0,1) is divided according to the model, 

35 and a segment [0.2,0.5) corresponding to symbol e, the 
first symbol in string eaii!, is selected. The 
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selected segment [0.2,0.5) is divided according to the 
model into segments [0.2,0.26), [0.26,0.35), 
[0.35,0.38), [0.38,0.44), [0.44,0.47), and [0.47,0,5) 
corresponding to symbols a, e, i, o, u, ! respectively, 
5 and the segment [0.2,0.26) corresponding to symbol a, 
the second symbol of "eaii!", is selected. 
Partitioning and selecting of segments is repeated for 
the remaining symbols i, i, and ! and results in a 
final selection of the interval [0.23354,0.2336) which 

10 corresponds to string "eaii!". Arithmetic coding 

selects from the selected final interval a code value 
that requires the minimum number of bits to express. 
For the example string of Fig. 2, code value 
0.233581542 is 0.0011101111011 binary. 

15 The prior art has not provided an efficient way 

employ arithmetic coding in moving image encoding where 
strings containing differing Icinds of symbols have 
different ranges of possible values ^and different 
statistics. 

20 

SUMMARY QP THE INVENTION 

In accordance with an embodiment of this 
invention, a process for variable length coding a 
string of symbol values employs a first arithmetic 

25 coding model for a first symbol value from the string 
and a second arithmetic coding model, which differs 
from the first, for a second symbol value from the 
string. Successive intervals are partitioned according 
to different models which are derived from different 

30 statistical distributions. The correct model for a 

symbol value is determined from a predefined syntax for 
the string. Accordingly, syntax-based adaptive 
arithmetic encoding in accordance with this invention 
can efficiently encode symbol values not encoded in the 

35 prior art and can efficiently combine symbols having 
different statistics into a single code. Encoding 
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according to this embodiment is more efficient than 
prior art arithmetic coding in that the average length 
of codes generated in accordance with this invention is 
shorter than the average code length generated using a 
5 single arithmetic coding model for all values in a 
string . 

One particular embodiment of the invention employs 
multiple arithmetic coding models for encoding symbols 
representing a discrete cosine transform (DCT) of a 

10 difference block. Symbols representing the top-left 
coefficients of the DCT of the difference block have 
different statistics and a matching arithmetic coding 
model, and a different arithmetic coding model is used 
for coefficients further from the top- left corner, 

15 Another embodiment of the invention is a 

videophone including an information extracting block 
which generates a series of symbol values representing 
a moving image, and a coding block which encodes 
strings of symbol values from the series using a 

20 syntax-base adaptive arithmetic coding process. 

Typically, the coding block includes: a memory which 
stores a plurality of arithmetic coding models; a 
syntax decoder which selects from the memory a model 
which, according to a predetermined syntax for the 

25 series, corresponds to a symbol value in the series; an 
arithmetic coder which encodes each symbol value from a 
string into a code value according to the corresponding 
arithmetic coding model selected by the syntax decoder. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows a block diagram of a video encoding 
portion of a videophone. 

Fig. 2 illustrates arithmetic coding of a string 
of symbols. 

35 Fig. 3 illustrates an example of a syntax for a 

series of symbols describing a moving image. 
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Fig, 4 is a block diagram of video encoding and 
decoding portions of a pair of videophones in 
accordance with an embodiment of this invention. 

Fig, 5 shows an order in which coefficients of a 
5 block are evaluated during generation of symbols 
representing the block. 

Fig. 6 illustrates a process of encoding and 
decoding a string of symbols using adaptive arithmetic 
coding in accordance with an embodiment of this 
10 invention. 

Figs- 7A and 7B show block diagrams of syntax- 
based encoding and decoding units in accordance with an 
embodiment of this invention. 

Figs. 8A and 8B illustrate alternative syntaxes 
15 for arithmetic encoding in accordance with this 
invention. 

Use of the same reference symbols in different 
figures indicates similar or identical items. 

20 DETAILED DESCRIPTION OF THE PREFERRBD EM BQDIMEIITS 

In accordance with an embodiment of the invention, 
a coding block performs arithmetic coding using 
different arithmetic coding models for different 
symbols in a message. The model being used changes 

25 according to the syntax of the message being coded. 
For example, the syntax of the message indicates the 
meaning of each symbol in the message, and the 
statistical distribution of values for a symbol 
typically depends on meaning of the symbol. A 

30 different arithmetic coding model selected according to 
the syntax of the message may be used for each symbol • 
Syntax-based arithmetic coding in accordance with the 
invention produces a single code value determined from 
a string of symbols and multiple models. 

35 Symbol values of the same type can also be coded 

using different models. For example, a string of 
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symbols representing coefficients of a transformed 
difference block can be arithmetically coded using a 
first arithmetic coding model for the first symbol and 
a second arithmetic coding model for the second symbol. 
5 The model used for a symbol is determined by the order 
or position of the symbol in the message, and the model 
can be based on statistics for a symbol in that 
position. This is useful for example because a first 
symbol often indicates a large coefficient and a short 
10 run of zeros while following coefficients indicate 

smaller coefficients with longer runs of zeros. If the 
syntax of a message defines a different order for the 
symbols, the order in which the arithmetic models are 
used changes to match the syntcix. 
15 Pig. 3 illustrates an example of a syntax for a 

representation of a moving image. This syntax is 
described in NTR "Video Codec Test Model, TMN4 Revl", 
IW Telecommunication Standa rdization Sector, study 
group 15. Working Party IS/l. LBC-94-, October 25, 1994 
20 which is incorporated by reference here in its 

entirety. Of particular relevance to this disclosure 
is that the syntax shown in Fig. 3 contains many 
symbols PSC, TR, PTYPE, QUANT, PEI, SPARE, GBSC, GN, 
COD, CBPCM, CBPy, DQUANT, MV, INTRADC, and TCOF, each 
25 of which has a different meaning and a different 
statistical distribution of values. 

For the syntax of Fig. 3, three two-dimensional 
arrays, one for each color component U, and V, 
represent a current frame. If the current frame 
30 differs greatly from a previous frame, the current 

frame is referred to as an intra-frame, and each array 
is divided into 16-by-16 pixel blocks referred to as an 
intra-block. 

If the current frame is sufficiently similar to a 
35 previous frame, the current frame is referred to as an 
inter- frame, and each array is divided into 16 -by- 16 
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pixel blocks. For inter -frames, motion estimation 
determines motion vectors which indicate the blocks in 
the previous frame which closely match the blocks in 
the current frame • Inter-blocks are the difference 
5 between a block in the current the frame and a closely 
matching block in the previous frame- If a block in 
the current inter- frame has no closely matching block 
in the previous frame, an intra-block is used which 
contains the pixel values of the block rather theun a 

10 difference. 

Different color components Y, U, V are treated 
differently. 16 -by- 16 blocks for an intensity 
component Y are handled as four 8 -by- 8 blocks. Each 
16 -by- 16 block for color components U and V is 

15 converted to a single 8 -by- 8 block by combining a set 
of four pixel values in the 16 -by- 16 block to generate 
one value in the 8-by-8 block. This reduces the amount 
of data at the expense of losing information that is 
less important to picture quality- A macro block is a 

20 set of six 8-by-8 blocks, four for the Y component, one 
for the U conqponent, and one for the V component. 

Each frame of a moving image is indicated by a 
picture layer message 310 that starts with a symbol PSC 
having the value of a picture start code, for example 

25 1000 0000 0000 0001 0000 binary. Following symbol PSC 
is a symbol TR which is a time reference indicating the 
position of the frame in the succession of frames. A 
picture type symbol PTYPE follows symbol TR and 
indicates frame parameters such as frame size and 

30 whether the frame is an intra- frame or an inter- frame. 
Symbol QUANT follows symbol PTYPE and indicates the 
step size used for quantizing DCT coefficients. 

Symbol PEI is an extra information flag bit. An 
8 -bit symbol SPARE containing user defined information 

35 follows symbol PEI when symbol PEI has value 1. 

Following the symbol SPARE is another symbol PEI which 
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indicates if there is another SPARE symbol. After the 
first symbol PEI which equals zero is a GOB layer 320 
message which describes a group of blocks in the frame. 
In GOB layer, symbols GBSC and GN are the start 
5 code for a row of macroblock and a number identifying 
the macroblock row as described in the above 
incorporated reference. Symbols GBSC and GN are 
optional and may be skipped by proceeding directly to a 
macroblock layer 330. 

10 MB layer 330 begins with a symbol COD which 

indicates whether the message contains a description of 
a group of macroblocks. If a group is not described, 
the remaining symbols in the macroblock layer are 
skipped. Symbols CBPCM and CBPY indicate which of the 

15 8 -by- 8 color component blocks in a macroblock are 

described and whether each color component block is an 
intra-block or an inter-block. Symbols DQUANT and MV 
indicate the quantization technique, a step size 
change, and x and y component change for the motion 

20 vector corresponding to the macroblock. Symbol INTRADC 
represents the DC components of a discrete cosine 
transformation (DCT) of an intra-block and is skipped 
for inter-blocks. Symbol TCOF is repeated to provide a 
series of values representing the remaining 

25 coefficients of an intra-block or all of the 
coefficients of an inter-block. 

The syntax illustrated in Fig. 3 is only an 
example of one message syntax in which an embodiment of 
the invention may be employed. As will be apparent 

30 from this disclosure, other embodiments of the 

invention may be employed in an almost limitless 
variety of other message syntaxes. 

Fig. 4 shows a video encoding portion 470 of a 
first videophone connected via a channel 490 to a video 

35 decoding portion 480 of a second videophone. Channel 
490 is a relatively low bit rate channel (less than 
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28,8 kbit/s) such as provided by a standard phone line 
using modems. The actual available bit rate for video 
transmission over channel 490 could be less (about 8 to 
20 kbit/s) because some of the bandwidth of channel 490 
5 is for audio transmission. Video encoding portion 470 
receives an input signal VIDEO_IN from devices (not 
shown) such as a CCD (charge coupled device) camera and 
a digitizer. Signal VIDEO_IN includes a series of 
digital values which form two-dimensional arrays of 
10 pixel values for frames of a moving image- The pixel 
values may for example indicate intensity for a 
grayscale image or color components such as standard 
RGB or YUV components. The example of pixel values 
that are YUV color components is described herein. 
15 Information extracting unit 410 processes the two- 

dimensional arrays of pixel values to generate a 
message that requires fewer bits to describe the moving 
image. Information extracting unit 410 can be 
implemented in software executed on a general purpose 
20 digital signal processor such as a computer which 

receives digital input signal VIDEO_IN and stores and 
processes the digital values. Alternatively, 
information extracting block may be implemented in 
hardware or a combination of customized hardware and 
25 firmware. One embodiment of information extracting 
unit 410 is the source coder described by X. Ran, 
"Draft Recommendation H.26P (TMN4Revl changes included) 
(Video coding for narrow telecommunication channels at 
< 64 kbit/s)", ITU Telecommunication Standardization 
30 Sector, Study Group 15, Working Party 15/1, Expert's 
Group LBC-94-, October 1994 which is incorporated by 
reference here in its entirety. 

Information extracting unit 410 performs a 
discrete cosine transformation (DCT) on inter-blocks 
35 and intra -blocks and then quantizes the DCT 

coefficients, for example by dividing each coefficient 
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by a fixed step size and rounding the result to a fixed 
number of bits indicating a level. 

For each 16 -by- 16 block in the current frame, 
information extracting unit 410 converts six 8-by-B 
5 blocks of a macroblock into six series of symbols. The 
64 coefficient of a 8 -by- 8 block are considered in a 
zig-zag order shown in Fig. 5 when determining a series 
of symbols for the block. Each symbol indicates one 
non-zero coefficient and contains a bit which indicates 
10 if the symbol is the last in the series, bits which 
indicate the length of a run of zero coefficients 
occurring before the non-zero coefficient, and bits 
indicating the level of the non-zero coefficient. In 
the syntax of Fig, 3, a series of symbols representing 
15 an intra-block is one symbol INTRADC followed by 

repeated occurrences of symbol TCOF, and a series of 
symbols representing an inter-block is repeated 
occurrences of symbol TCOF. 

Information extracting unit 410 of Fig. 4 provides 
20 to a coding unit 420 the symbols indicated in Fig. 3 in 
the order defined by Fig. 3. Coding unit 420 consists 
of four layers, a syntax-based adaptive arithmetic 
coder (SAAC) layer 422, an arithmetic coding layer 424, 
a picture start code (PSC) layer 426, and a bitstream 
25 layer 428. Each layer may be implemented in hardware, 
software, firmware, or a combination of the three. 
Fig. 7A shows a block diagram of a hardware embodiment 
of coding unit 420. 

SAAC layer 422 and arithmetic coding layer 424 
30 perform syntax-based adaptive arithmetic coding in 

accordance with an embodiment of the invention. SAAC 
layer 422 receives a symbol value from information 
extracting unit 410 and selects an arithmetic coding 
model that arithmetic layer 424 uses for the symbol. 
35 SAAC layer 422 selects the correct model for a symbol 

according to the syntax of the message. For the syntax 
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of Fig. 3, each symbol value received by SAAC 
corresponds to one of symbols PSC, TR, PTYPE, QUANT, 
PEI, SPARE, GBSC, GN, COD, CBPCM, CBPY, DQUANT, MV, 
INTRADC, and TCOF. SAAC layer 422 selects different 
5 arithmetic models for symbols COD, CBPCM, CBPY, DQUANT, 
MV, INTRADC, and TCOF and directs arithmetic coding 
layer 424 to encode each of those symbols as part of a 
single code. Symbols PSC, TR, PTYPE, QUANT, PEI, 
SPARE, GBSC, GN are fixed length coded by routing the 
10 symbols past arithmetic coding layer 425 to a PSC layer 
described below. 

Arithmetic coding layer 424 encodes each symbol 
using the corresponding arithmetic coding model 
selected by SAAC layer 422. Fig. 6 illustrates an 
15 example of syntax-based adaptive coding where 

arithmetic coding layer 422 receives a string of 
symbols having values aSaai. The first, third, and 
fifth symbol values (a, a, and !) are for symbols which 
the syntax indicates has six possible values {a, e, i, 
20 o, u, l}. The arithmetic coding model for the first, 
third, and fifth symbol is based on statistical 
probabilities of each value, 20%, 30%, 10%, 20%, 10%, 
and 10% for symbol values a, e, i, o, u, and 1 
respectively. The second and fourth symbol value are 
25 for symbols having two possible values {a, S} where 
both values have the same probability 50%. 

Encoding the string a&aa! as a code value in the 
interval [0,1) proceeds as follows. SAAC layer 422 
receives the first symbol value a from information 
30 extracting unit 410 and selects from memory the model 
indicating probabilities 20%, 30%, 10%, 20%, 10%, and 
10%. Arithmetic coding layer 424 divides the interval 
[0,1) into segments [0,0.2), [0.2,0.5), [0.5.0.6), 
[0.6,0.8), [0.8,0.9), and [0.9,1) according to the 
35 model, and then selects interval [0,0.2), the interval 
corresponding to symbol value a. The final code value 
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has the form O.OOx because the code value is in the 
interval [0,0.2) and therefore less than 0.01 binary. 
Arithmetic coding layer 424 transmits two bits 00 to 
PSC layer 426 indicating the first two bits in the code 
5 value . 

For the second symbol value S, SAAC layer 422 
selects the model indicating probabilities 50% and 50%. 
Arithmetic layer divides the selected interval [0,0.2) 
into segments [0,0.1) and (0.1,0.2) according to the 

10 model from SAAC layer 422 and then selects interval 

[0.1,0.2), the interval corresponding to symbol value 
IS. No new bits can be sent to PSC layer 426 because 
the interval [0.1,0,2) contains values greater and less 
than 0.001 binary. Interval (0.1,0.2) is divided 

15 according to the model for the third symbol, and 

interval (0.1,0.12) is selected according to the symbol 
value a. Bits Oil are sent to PSC layer 426 indicating 
the code will be less than 0.001 binary but greater 
than 0.00011 binary. Interval [0.1,0.12) is divided 

20 according to the model for the fourth symbol, and 

interval (0.1,0.11) is selected for symbol value a. No 
bit is sent to PSC layer 426 because 0.000111 binary is 
in the interval (0.1,0.11). Finally, interval 
[0.1,0.12) is divided according to the model for the 

25 fifth symbol and interval (0.109,0.11) is selected for 
symbol value I . Bit 1 is sent to indicate a code value 
0.109375 which is 0.000111 binary. For the example of 
Fig. 4, arithmetic coding layer 424 transmits six bits 
000111 to PSC layer 426 in order most significant to 

30 least significant bits and uniquely represents the 
string of symbol values afSaal. 

As an interval is successively, the end points of 
the selected intervals must be known to higher and 
higher precision, but the memory locations which store 

35 the end points have precision limited to a fixed number 
of bits. However, after transmission of a bit from 
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arithmetic coding layer 424, the most significant bit 
of both end points must be equal and are not required 
for further coding. Accordingly, the most significant 
bit can be shifted out of the memory locations to 
5 provide room for a least significant bit and provide 
higher precision. When a selected interval has an 
upper end point above 0.1 binary but below 0.11 binary 
and has a lower end point below 0.1 binary but above 
0.01 binary, arithmetic coding layer 424 attempts to 

10 determine two or more code bits to be sent. 

PSC layer 426 is basically a FIFO buffer which 
receives bits from arithmetic coding layer 424 and SAAC 
layer 422 and checks for a bit secjuence which could 
give a false PSC (picture start code) . If such a bit 

15 sequence occurs, an extra bit is stuffed into the 

sequence. For example, if the PSC is 1000 0000 0000 
0001 0000 and sequence 1000 0000 0000 0001 000 occurs 
in PSC layer 426, PSC layer 426 checks a flag signal 
from SAAC layer 422 to determine if a PSC is being 

20 transmitted. If a PSC is not being transmitted, PSC 
layer 426 changes a 19-bit series 1000 0000 0000 0001 
000 to a 20-bit series 1000 0000 0000 0001 0001. 

Bitstream layer 428 moves bits from PSC layer 426 
to an output channel buffer 430. Bit from output 

25 channel buffer 430 are transmitted across channel 4 90 
to an input channel buffer 440 in decoding portion 480 
of the second video phone. A bitstream layer 458 in a 
decoding unit 450 moves bits from input channel buffer 
440 into a PSC layer 456. PSC layer 456 changes a 20- 

30 bit series 1000 0000 0000 0001 0001 to a 19-bit series 
1000 0000 0000 0001 000. 

An arithmetic decoding layer 454 in decoding unit 
450 receives bits from PSC 456 and decodes the bits 
according to models provided by an SAAC layer 452. For 

35 example, SAAC layer 422 selects a first model for 

arithmetic decoding layer 454, and arithmetic decoding 
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layer 454 retrieves a number of bits from PSC layer 456 
to determine which interval of the model is indicated 
by the code. Arithmetic decoding layer 454 then sends 
a corresponding symbol to SAAC layer 452. SAAC layer 
5 452 sends the decoded symbol value to a video 

reconstructing unit 460 « and selects the arithmetic 
coding model arithmetic decoding layer 454 uses when 
decoding the next symbol. Video reconstructing unit 
460 receives decoded symbols from decoding unit 450 and 

10 performs the inverse of the operation performed by 
information extracting unit 410. 

Fig. 6 can also be used to illustrate the syntax- 
based arithmetic decoding process. Initially, SAAC 
layer 452 selects a model for the first symbol. 

15 Arithmetic decoding layer 454 receives the first bit 
which is zero and determines that the first symbol 
value is either a or e, the symbol values in the bottom 
half of interval [0,1). Arithmetic decoding layer 454 
receives the second bit, which is also zero, and 

20 determines that the first symbol value is a. SAAC 
layer receives the value a and selects the next 
arithmetic coding model for partitioning the interval 
[0,0.2) . Arithmetic decoding layer receives the third 
bit which is 1, and determines the code value is in the 

25 top half of interval [0,0.2) therefore the second 

symbol is fi. SAAC layer 452 selects the next model 630 
based on the preceding symbol values a and 6. Decoding 
proceeds a bit at a time in this fashion until SAAC 
determines I is the last symbol in the string. 

30 Fig. 7A shows a block diagram of a hardware 

embodiment of encoding unit 420. In encoding unit 420, 
SAAC layer 422 includes a finite state machine (FSM) 
710 which monitors symbols from information extracting 
unit 410 and generates signals indicating a state 

35 defined by the syntax of the message being encoded. 

Such state machines are well known in the art and may 
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be a part of SAAC layer as shovm in Fig. 7A. 
Alternatively, the functions of FSM 710 may be combined 
with the function of a state machine in information 
extracting unit 410. An output signal from FSM 710 
5 selects address of memory location in memory 730 

containing values which define an arithmetic coding 
model. FSM 710 also controls a multiplexer 720. 
Symbols to be encoded are routed through multiplexer 
720 to arithmetic coding logic 740 in arithmetic coding 

10 layer 424. Arithmetic coding logic 740 uses the symbol 
value and the model from memory 730 to generate bits of 
a code value which are written to a FIFO buffer 750. 
Symbols that are not to be encoded by arithmetic coding 
layer 424 are routed through multiplexer 720 directly 

15 to FIFO buffer 750. From FIFO buffer 760, bits are fed 
into bitstream 428 and channel 490. 

Fig. 7B shows a block diagram of a decoding unit 
450. Decoding unit 450 contains a FIFO buffer 790 
which receives bits from bit stream 458. Bits are read 

20 from FIFO buffer 790 pass through a multiplexer 780 
which- is controlled by a finite state machine (FSM) 
760. Bits which make up fixed length sv-nbols are 
routed through multiplexer 780 to recc* structing unit 
460. Bits to be decoded are routed t. arithmetic 

25 decoding logic 770. Arithmetic coding logic 770 

converts bits from the bitstream into symbols using an 
arithmetic coding model stored in a memory 765. FSM 
760 selects the appropriate arithmetic coding model 
according to the previously symbol value either from 

30 arithmetic coder 770 or FIFO 790, 

Fig. 6 provides an illustrative example of syntax- 
based arithmetic coding of a string. For actual 
symbols such as defined by the syntax of Fig. 3, the 
numbers of possible values and statistical 

35 , distributions will differ from those given in the 
example of Fig. 6. The actual statistical 
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distributions of a symbol can be determined by 
observing the values of the symbol during encoding of 
video images. 

Arithmetic coding models for symbols to be encoded 
5 can be generated off-line and stored in memory before 
operation of a videophone. However, if desired, models 
can be updated, by counting the occurrences of each 
value for a symbol during operation. The counts can be 
occasionally normalized, for example by dividing each 
10 count by a fixed value, to stop any count from 
exceeding the capacity of its storage location. 
Updating the models during operation allows the models 
improve for the particular use of the videophone and 
causes coding become more efficient for the particular 
15 application. 

In addition to providing models corresponding to 
different symbols, multiple models can be used for some 
symbols « For example, some symbols such as symbol TCOF 
in the syntax of Fig. 3 occur many times in a typical 
message, and an arithmetic coding model can be selected 
not only according to the symbol but also according to 
the order of occurrence of the symbol in the message. 

Fig. 8A shows an expanded version of block layer 
340 from the syntax illustrated in Fig. 3. A loop 344 
which repeats symbol TCOF in Fig. 3 is replaced by 
symbols TCOFl, TC0F2, and TC0F3 (which are first three 
occurrences of symbol TCOF in loop 344) and a loop 844 
which repeats symbol TCOFR as many times as is required 
to represent the DCT coefficients of an 8 -by- 8 block. 
Each symbol TCOFl, TC0F2, TC0F3, and TCOFR has a value 
which contains a bit indicating if the symbol is the 
last in the series, bits indicating the length of a run 
of zero values occurring before a non-zero coefficient, 
and bits indicating the level of the non-zero 
coefficient. DCT transformations commonly concentrate 
non-zero coefficients in top-left corner of a block of 
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coefficients. Additionally, the non-zero coefficients 
near the top- left of a block tend to be larger than the 
other coefficients in the block. Thus, symbol TCOFl 
representing the first non-zero coefficient found along 
5 the order shown in Fig. 5 tends to have a high level 
and a low run of zero coefficients. In contrast, a 
non-zero coefficient encoded later according to the 
order given in Fig. 5 tends to be preceded by longer 
runs of zero valued coefficients and tends to have a 
10 smaller level. To account for the different 

statistical distributions of values for symbols TCOFl, 
TCX)F2, TC0F3, and TCOFR, different models can be used 
for each of symbols TCOFl, TC0F2, TC0F3, and TCOFR. 
Additionally, models can be distinguished according to 
15 whether the DCT coefficient described are for an intra- 
block or an inter-block. 

The process of Fig. 8A uses four different 
arithmetic coding models for different occurrences of 
symbol TCOF of Fig. 3. Alternatively, more or fewer 
20 models can be used for symbol TCOF. Adding models 
improves the jefficiency of coding but increases the 
memory required to store the models. Testing has 
indicated that the improvement in coding efficiency per 
amount of memory drops sharply beyond four arithmetic 
25 coding models. 

Fig. 8B shows another expanded version of block 
layer 340 from the syntax illustrated in Fig. 3. In 
Fig. 8B, loop 344 repeating symbol TCOF is divided into 
symbols TCOFTl, TCOFTR, TCOFBl, TC0BF2, TC0FB3, and 
30 TCOFBR, each of which has a corresponding arithmetic 
coding model. The symbols TCOF are determined in the 
manner as described above and contain run lengths of 
zero values along the order shown in Fig. 5, but the 
message syntax of Fig. SB sends the symbols in reverse 
35 order. Symbol TCOFTl contains a level for the last 

non-zero DCT coefficient found along the order given in 
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Fig. 5. Accordingly, symbol TCOFTl corresponds to the 
tail of the series of symbols found using the order of 
Pig. 5, 

Symbol TCOFTR which is repeated in loop 846 is 
5 also near the tail of the order. Loop 846 repeats 
symbol TCOFTR until the level indicated by a value of 
symbol TCOFTR rising above a threshold level. If the 
level indicated is greater than the threshold value, 
the next symbol is TCOFBl. This separation of 

10 arithmetic models according to level is found to 
provide very efficient coding. After the level 
indicated reaches the threshold, four more arithmetic 
models are used. The final model for symbol TCOFBR is 
repeated as many times as necessary if there are more 

15 than six symbol describing the DCT coefficients of a 
block . 

Although the present invention has been described 
with reference to particular embodiments, the 
description is only an example of the invention's 

20 application and should not be taken as a limitation. 
Embodiments of this invention can also be implemented 
in software. Various other adaptations and 
combinations of features of the embodiments disclosed 
will be apparent to those skilled in the art and are 

25 within the scope of the present invention as defined by 
the following claims. 
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I claim: 

1. A method for encoding a string of symbol 
values, comprising: 

selecting an interval to contain a code 
5 valued- 
selecting a first arithmetic coding model, 
wherein the first arithmetic coding model is 
associated with a first symbol in a syntax for the 
string and partitions the interval into a set of 
10 first segments that correspond one-to-one with 

possible values for the first symbols- 
changing the interval to contain the code so 
that the interval is equal to the one of the first 
segments which corresponds to a first symbol value 
15 in the string; 

selecting a second arithmetic coding model, 
wherein the second arithmetic coding model is 
associated with a second symbol in the syntax for 
the string and partitions the interval into a set 
20 of second segments that correspond one-to-one with 

possible values for the second symbol; and 

generating a code value which is contained in 
the one of the second segments that corresponds to 
a second symbol value in the string. 

25 

2. The method of claim 1, wherein the syntax for 
the string defines symbol types for symbols, and each 
symbol type indicates a range of possible symbol values 
for a symbol. 

30 

3. The method of claim 2, wherein the first 
symbol and the second symbol are of the same symbol 
type. 

35 4. The method of claim 3, wherein each of the 

first segments has a length proportional to the 
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Statistical probability of the first symbol having the 
symbol value corresponding to the segment, each of the 
second segments has a length proportional to the 
statistical probability of the second symbol having the 
5 symbol value corresponding to the segment, and 

differences between the first arithmetic model and the 
second arithmetic model reflect differences between 
statistical probabilities of values for the first 
symbol and statistical probabilities of values for the 
10 second symbol. 

5. The method of claim 1, wherein for each 
symbol, the method further comprises: 

selecting an arithmetic coding model which is 
15 associated with the symbol and partitions the 

interval into a set of segments that correspond 
one-to-one with possible values for the symbol, 
wherein the first arithmetic coding model is 
associated with the first symbol, and the second 
20 arithmetic model value is associated with the 

second symbol; and 

changing the interval so that the interval is 
equal to the segment which, according to the 
selected arithmetic coding model, corresponds the 
25 symbol's value from the string. 

6. The method of claim l, wherein each of the 
first segments has a length proportional to the 
statistical probability of the first symbol having the 

30 symbol value corresponding to the segment, and each of 
the second segments has a length proportional to the 
statistical probability of the second symbol having the 
symbol value corresponding to the segment . 
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7. The method of claim 1, further comprising: 

performing a discrete cosine transform on a 
block of values representing an area in a moving 
image, wherein the discrete cosine transform 
5 generates a block of transform coefficients; and 

generating the string of symbol values from 
the block of transform coefficients. 

8. The method of claim 7, wherein the block of 
10 values comprises pixel values for the area in a frame 

of the moving image. 

9. The method of claim 7, wherein the block of 
values represents a difference between the area in a 

15 first frame of the moving image and the area in a 
second frame of the moving image. 

10. The method of claim 7, wherein the first 
symbol and the second symbol both represent trans fonn 

20 coefficient • 

11. The method of claim 10, wherein the first 
symbol and the second symbol are symbols of the same 
type, and each of the first and second symbols contains 

25 bits which indicate a length for a run of zero 

transform coefficients occurring before a non-zero 
transform coefficient and bits indicating a level for 
the non-zero transform coefficient. 

30 12. A method for generating a coded bit stream 

representing values for symbols in a string, the method 
comprising: 

storing a first binary value and a second 
binary value in a memory, wherein the first and 
35 second values represent boundaries of an interval; 

and 
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for each symbol given by a syntax for the 
string: 

determining, from the syntax, an arithmetic 
coding model for that symbol; 

changing the first and second binary values 
to indicate new boundaries for the interval, 
wherein the new boundaries depend on old 
boundaries for the interval, the arithmetic model 
for that symbol, and a value provided for the 
symbol by the string; and 

in response to every value within the 
interval having the same most significant bit, 
transmitting that most significant bit and 
shifting the first and second binary values to 
remove the most significant bit. 

13. A method for decoding a bit stream 
representing a string of symbol values, the method 
comprising: 

storing bits from the bit stream in a FIFO 
buffer; 

storing a first binary value and a second 
binary value in a memory; 

selecting an arithmetic coding model for a 
first symbol in the string; 

determining, from the selected arithmetic 
model, boundaries of segments which constitute a 
partition of an interval bounded by the first and 
second binary values, wherein the segments are in 
one-to-one correspondence with possible symbol 
values; 

detecting that a selected one of the segments 
contains all binary values which have most 
significant bits equal to one or more oldest bits 
from the FIFO buffer; and 
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determining the symbol value that corresponds 
to the selected segment. 

14. The method of claim 13, further comprising: 
5 removing from the FIFO buffer a least number 

of oldest bits for which all possible binary 

values that begin with those oldest bits are 

within the selected segments- 
shifting the first and second binary value by 
10 the number of bits removed from the FIFO buffers- 

selecting an arithmetic coding model for the 

next symbol in the string; and 

repeating the last three steps of claim 13. 

15 15. An apparatus for transmitting a video image, 

comprising: 

an information extracting block which 
generates a series of symbol values representing a 
video image; and 
20 a coding block operably coupled to receive 

the series of symbol values from the information 
extracting block, wherein the coding block 
comprises: 

a memory containing information 
25 describing a first arithmetic coding model 

and a second arithmetic coding model; 

a state machine operably connected to 
the memory, wherein the state machine selects 
the first model in response to receiving a 
30 symbol value corresponding to a first symbol 

in a syntax for the series from the 
extracting block and selects the second model 
in response to receiving a symbol value 
corresponding to a second symbol in the 
35 syntax; and 
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an arithmetic coder operably connected 
to the memory and the information extracting 
block, wherein the arithmetic coder uses the 
information which describes the model 
selected by the state machine for a symbol 
when the arithmetic coder encodes the 
symbol's value in an arithmetic code. 



16. The apparatus of claim 15, wherein the 
10 information describing the first arithmetic coding 

model indicates statistical probabilities of symbol 
values of the first symbol. 

17. The apparatus of claim 16, wherein arithmetic 
15 coder comprises: 

a second memory for storing a first binary 
value and a second binary, wherein the first and 
second values represent boundaries of a first 
interval ; 

20 means for changing the first and second 

binary values to indicate boundaries of a second 
interval, wherein the boundaries of the second 
interval depend on the first binary value, the 
second binary value, the arithmetic model selected 

25 by the state machine, and a symbol value to be 

encoded; and 

means for generating a code bit when the most 
significant bit of every value in the second 
interval is the same. 



30 



18. The apparatus of claim 17, wherein the 
arithmetic coder further comprises means for shifting 
the most significant bit out of the first and second 
value when a code bit is generated. 



35 
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19. The apparatus of claim 16, wherein the coding 
block further comprises means for updating the 
statistical probabilities according to occurrences of 
symbol values during encoding. 

5 

20. The apparatus of claim 15, wherein the coding 
block further comprises multiplexer controlled by the 
state machine, wherein the multiplexer receives the 
symbol values from the extracting block and either 

10 provides a symbol value to the arithmetic coder or 
bypasses the arithmetic coder, wherein for a third 
symbol, the state machine causes the multiplexer to 
bypass the arithmetic coder. 
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